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CURRENT  PROGRESS 


During  the  past  -fiscal  vear, 
support  activities  in  -four  related  areas  o-f  associative 

programming:  1  >  implementation  o-f  the  ASPROL  compiler,  2'. 

implementation  of  the  FIRST,  NEXT ,  PREVIOUS,  TRUNC  and  TRUNCA 

}  J  J 

•code  -functions  and  I/O  staci  routines'.  3),  implementation  of  the 
associative  database  primitives.'  and  4'.  analysis  of  new  network 
designs  for  interprocessor  communications  in  associ atiwe 
processors . 


A  cor v  of  the  ASRRDL  compiler  has  been  given  to  Goodyear 
Aerospace  for  evaluation  for  use  with  their  ASRRO  SI  ML’  computer. 
It  is  felt  that  this  compiler  can  be  augmented  with  rule  based 
constructs  that  will  provide  a  high  performance  expert  system 
capatilit-.  .  Many  of  GAC’s  customers  such  as  Grumman  Aerospace. 
NQSC  and  NSWC  are  inte-ested  m  such  a  capability.  Preliminary 
studies  performed  under  a  seperate  grant  indicate  that  speeds  of 
about  60 '1*0  inferences  oe^  second  are  obtainable. 


The  following  papers  were  produced 

DATA  STRUCTURES  FOR  ASSOCIATIVE  COMPUTING  'submitted  for  publication 
AN  ASSOCIATIVE  MODEL  OF  DATA  'submitted  for  publication) 

ON  PERFORMANCE  OF  SW— BANYAN  NETWORKS  'submitted  for  publication* 


The  ASPROL  Programming  Language. 

|  The  ASPROL,  language  developed  last  vear  and  described  in 

\  last  years  report  was  implemented  on  the  KSU  VAX  780  research 

t 

\  computer.  The  comoiler  converts  ASRRDL  statements  into  a  high 

l 

I  level  intermediate  form  similar  to  the  form  described  in  the 
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origin?:  proposal  (p.  6'.  This  intermediate  code  can  be 
concerted  into  assemfclv  language  code  -for  anv  bit  serial  SIMD 
compute-.  The  current  implementation  produces  code  -for  the 
STARAN  E  end  the  ASPRO  computers.  (Attached  is  a  copy  o*f  the 
ASPPOL  users  guide'. 

Primitive  ACODE  and  I/O  Stacl  Operators 

The  FIRST,  NEXT.  PREVIOUS,  TRUNC  and  TPUNCA  primitive  ACODE 
ope*"ato*"s  developed  -for  manipulating  data  structures  during  the 
•first  */e?r  o-f  this  effort  were  implemented  and  tested  on  the 
STARAN  E.  FIRST  returns  the  acode  of  the  ne::t  lower  level  in  the 
hierarch  .  .  NEXT  returns  the  ne-.t  acode  at  the  current  le-'el  c  + 
the  hie** arch  -.  PREVIOUS  returns  the  previous  acode  at  the 
current  le  el.  TRUNC  returns  the  acode  of  the  ne*'t  higher  level 
in  the  hiera-chv.  TRUNCA  returns  the  acode  of  the  too  level  of 
hi er ar oh  .  . 

Routines  which  simulate  the  I'D  stacl  ooerations  described 
in  the  original  proposal  were  written.  COLLAPSE  eliminates  the 
unused  words  in  the  I'D  stacl  (a  reserved  page  of  the  STARAN  E 
a*"**a  memo***/’  compacting  the  data  into  a  contiguous  bloc!:. 
EXPAND  *-e  e*"ses  the  orocess.  It  causes  the  contiguous  bloc!  of 
data  at  the  bottom:  of  the  stacl  to  be  expanded  to  words  which  are 
idle.  The  COLLAPSE  and  EXPAND  commands  simulate  hardware  commands 
that  facilitate  run  time  memory  allocation  in  associative 
processor  = . 

An  Associative  Database 

An  initial  design  for  an  associative  database  was  developed 
and  the  primitive  data  base  operators  were  implemented  on  the 


ST ARAN  E. 


Th  i  s 


database  is  designed  -for  natural  language  and 


artificial  intelligence  apDl i cati ons. 

Interconnection  Networks 

A  new  method  for  determining  the  number  of  permutations 
realized  by  SW-Banvan  networks  was  developed. 


PLANS 


The  major  emphasis  in  the  ne-:t  year  will  be  to  integrate  the 
structure  code  mechanisms  into  the  ASPRQL  compiler.  As  reported 
above,  the  primitive  functions  have  been  designed  and 
implemented,  but  they  have  not  been  integrated  into  the  ASPRQL 
language.  The  first  step  will  be  to  design  the  formal  language 
mechanism  which  will  be  used  in  ASPROL  and  then  the  language  will 
be  expanded  to  include  them. 

The  utility  of  the  structure  code  concepts  in  associative 
computers  and  high  level  languages  will  be  studies.  Not  only  do 
structure  codes  allow  dynamic  memory  allocation.  they  promise  to 
provide  a  common  data  organication  which  will  allow  such  diverse 
languages  as  LISP,  FORTRAN.  PROLOG  and  ASPPOL  to  communicate 
with  each  other. 

Some  effort  will  be  spent  on  the  initial  development  of  a 

theory  of  associative  computing  to  explain  its  relationship  with 

rule  based  languages.  - - 
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APPENDIX  A 


ASPROL  SYNTAX  PRIMER 
Version  2.00 


ASPROL  is  a  high  level  language  designed  specifically 
for  SIMD  parallel  processors  such  as  the  STARAN-E  and  ASPRO. 
This  primer  is  written  to  aid  users  in  writing  ASPROL  codes. 
It  specifies  the  correct  syntax  for  writing  ASPROL  programs . 
It  is  not  meant  to  be  a  user  guide  for  writing  STARAN-E  pro¬ 
grams.  There  is  a  class  offered  by  the  math  department: 
Parallel  Programming  and  a  series  of  seminar  about  Parallel 
programming.  For  those  interested  in  programming  the  STARAN- 
E,  please  consult  with  Dr.  Jerry  Potter. 


For  those  who  want  to  have .  a  copy  of  this  primer,  use 
this  command  on  your  command  level  on  Vax780  or  Vax750  : 


%  lpr  -1  /user/parallel/asprol/primer 


I  would  appreciate  your  help  debugging  this  compiler. 
Please  send  me  mail  about  any  bugs  that  you  may  find.  Please 
be  advised  that  ASPROL  will  quit  compiling  as  soon  as  the 
first  syntax  error  is  detected.  Please  don't  be  discourage 
by  this  little  inconvenience  ! 


I.  How  to  compile: 

To  compile  your  program  : 

%  asprol  -s  filename. asp  (for  STARAN-E  code) 
or 

%  asprol  -a  filename. asp  (for  ASPRO  code) 
Either  will  create  3  files  : 


1.  filename. 1st  : 

a  readable  program  listing  with  intermediate  code. 
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2.  filename. iob  : 

An  encoded  intermediate  file  which  is  expanded  in 
pass  two  to  assembly  language. 

3.  Option  -s  will  produce  :  filename. apl 

Assembler  code  which  can  be  assembled  by  the  STARAN- 
E  assembler. 

Option  -a  will  produce  :  filename. asm 

Assembler  code  which  can  be  assembled  by  the  ASPRO 

assembler. 


II.  Syntax  ; 


[  Program  identification  ] 
becfaration 
Variable  association 
body 
[  end;  ] 


Note ; 


An  "linclude  filename"  may  be  placed  anywhere  in  the 
source  program  for  one  level  of  file  indirection.  If 
the  include  file  ends  with  an  "endasm;"  statement,  it 
must  be  followed  by  at  least  two  blank  lines. 


A.  Program  Identification: 

option  1:  Main  Program_name 

-  Generate  DFs  and  DC  in  assembler. 

-  Generate  ENTRY'S  for  DCs. 

option  2:  Subroutine  Subroutine_name 

-  Generate  DFs  and  ENTRYs  for  DC. 

option  3:  none 

-  Generate  DFs  and  DC  in  assembler. 

-  Generate  ENTRYs  for  DCs  but  no  boiler 

plate . 


example  : 


main  program_name 

variables  association 
declarations 
body 
end ; 


subroutine  routine_name 
variable  association 
declaration 
body 
end ; 


Declaration  :  Declaration  should  be  specified  in  the 
following  order  ! 


1.  def ine (var, constant) ; 
def log (var, constant ) ; 

2.  def var 

3.  int  scalar  varl,  var2, 
int  parallel  varl,  var2, 

real  scalar  varl,  var2,  . ..; 
real  parallel  varl, 

index  scalar  varl,  var2,  . 
index  parallel  varl,  var2, 

logical  scalar  varl,  var2,  ...? 
logical  parallel  varl,  var2,  .  ..; 


1.  def ine/def log 
define : 

syntax  :  define  (var,  constant) ; 

var  :  regular  variable  name. 

constant  :  any  constant  value  (including  a 
previous  define) . 

example  :  define  (ten,  10); 

def ine  (decade,  ten); 
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def log : 

Deflog  is  used  for  defining  a  logical 
constant.  In  ASPROL,  this  definition  is  treated  as 
a  scalar  constant.  The  only  different  between 
define  and  deflog  is  in  its  data  type.  Define  is 
an  integer  scalar,  while  deflcg  is  a  define 
logical  constant  (a  special  data  type) . 


syntax 

var 

constant 

example 


deflog  (var,  constant); 
regular  variable  name, 
either  0  or  1 . 
deflog (true ,  1 ) ; 


2.  defvar 

If  defvar  is  present  in  front  of  an  int,  real 
or  logical  declaration,  the  declared  variables 
will  be  stored  in  the  ASPROL  symbol  table  and  be 
treated  as  regular  variables,  but  ASPROL  will  not 
cause  any  memory  allocation  macros  to  be 
generated.  This  option  is  provided  to  allow  users 
to  override  the  ASPROL  compiler  memory  allocation 
so  that  they  can  allocate  memory  manually  at 
assembly  time.  The  programmer  can  specify  his/her 
memory  layout  by  using  "asmcode"  statement,  or  by 
editing  the  '.apl'  file  before  assembling  it. 

example  ;  defvar  int  parallel  A:8[$],  C[$]; 


3.1  int,  real,  index,  logical 

syntax ;  variable_name : f ie ld_width [dimension] 


variable_name  : 

-  letters  :  A  -  Z,  a  -  z  (no  distinction) 

1-9 

-  Any  variable  name  should  begin  with  a 
letter.  Any  variable  name  longer  than  6 
characters  will  be  truncated. 


example  :  progl,  false,  P12345. 
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3.2  field_width  : 

int  :  2-32  bits  (scalar  &  parallel) 

(default  16  bits) . 

real  :  32  bits  (scalar  &  parallel) 

(default) . 

index  :  1  bit  scalar  &  parallel  . 
logical  :  1  bit  scalar  &  parallel  . 

Note : 

All  scalar  variable  are  stored  in  32  bit  words  re¬ 
gardless  of  declared  size. 


3.3  dimension: 

The  maximum'  limit  for  the  dimension  depends 
on  the  data  type.  For  int,  real,  and  logical  data 
types,  the  maximum  limit  for  the  dimension  is  3. 
The  first  dimension  for  the  parallel  mode  should 
always  be  a  The  index  data  type  is  always  1 
level  deep. 


Note  : 

Indexing  of  dimensioned  arrays  ranges  from  0  to 
size  -  1. 


example  : 

int  scalar  A,  B[32],  C: 8 [ 10, 10, 10] ; 
int  parallel  A[$],  B[$,10,10],  C:32[$], 
D:32[$,5,10] ; 

real  scalar  A,  B[10],  C[10,10,10]; 
real  parallel  A[$],  B[$,10],  C[$,10,10); 

index  scalar  A; 
index  parallel  A[$]; 

logical  scalar  A; 

logical  parallel  A[$],  B($,10],  C[$,10,10], 

XX [ $ ] ; 


c. 


Variable  Association : 


This  section  tells  ASPROL  which  variables  are 
associated  with  which  variables.  All  variables  used  in 
this  association  should  be  declared  in  the  previous 
declaration . 


syntax  :  associate  varl,  var2,  var3,  ...  with  varX; 

where  :  varl,  var2,...  are  parallel  mode  variable. 
varX  is  a  logical  parallel  variable. 


example  :  associate  A[$],  B[$],  C[$],  D[$]  with  XX [ $ ] ; 


D.  Body  : 

1.  If  statement  : 
if  (logjpar_exp)  then 

body 

(else 

body 
•  ] 

endif ; 

2.  For  statement  : 
for  index_var  in  ( log_par_exp) 

body 

endfor  index_var; 

3.  Get  statement: 
get  index_var  in  ( log_par_exp) 


body 

endget  index_var; 


r. 
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Next  statement: 

next  index_var  in  ( log_par_exp) 
body 

endnext  index_var; 

While  statement: 

while  index_var  in  ( log_par_exp) 
body 

endwhile  index; 

Any  statement: 

any  ( log_par_exp) 
body 
endany ; 

First  statement: 
first 

initialization § 
loop  statement 

note  : 

Only  simple  assignment  statements  and  initializa¬ 
tion  are  allowed  in  the  'first'  statement.  The 
'loop'  statement  should  follow  right  after  the 
'first'  statement. 


Loop  statement  : 


loop 

body 

until  (scalar  condition) 
endloop; 


note  : 

There  may  be  any  number  of  'until'  statements  in  a 
'loop  -endloop'  body. 

Allocate  statement: 

allocate  index  in  index_variable 
body 

endal locate  index; 


note : 

index_variable  is  index_variable  association, 
index  is  an  index  variable. 


asmcode  -  endasm  statement: 

asmcode  (must  be  typed  in  the  first  column) 
Staran-E  assemble  language  instructions, 
endasm;  (must  be  typed  in  the  first  column) 


note : 

All  statements  between  asmcode  and  endasm 
should  exactly  follow  all  rules  for  Staran-E 
assemble  language.  All  ASPROL  variables  that  are 
keywords  in  the  STARAN-E  assembly  language  should 
be  prefixed  with  a  '$  '  character. 


w 


example  : 

An  x  in  ASPROL  becomes  $X  in  STARAN-E 
assembly  language.  Similarly,  y  becomes  $Y 
and  r6  becomes  $R6. 

For  more  STARAN-E  keywords,  please  consult  with 
the  STARAN-E  assemble  language  manual. 


stop  statement  : 

The  'Stop'  statement  is  used  to  stop  the 
execution  of  the  program.  It  can  be  used  anywhere 
in  the  program. 

syntax  :  stop; 


return  statement  : 

The  'return '  statement  is  used  in  the 
subroutine  module  only.  It  is  used  to  return 
control  of  the  execution  to  the  instruction  after 
the  call  statement  in  the  calling  module. 

syntax  :  return; 


call  statement  ; 

syntax  :  call  subroutine_name ; 


setscope  statement  ; 

setscope  log_par_exp 
body 

endsetscope ; 


The  setscope  statement  is  used  to  set  the  M 
register  temporarily  with  the  log_par_exp  result. 
The  statements  within  the  setscope  -  endsetscope 
only  will  be  affected  by  the  new  M  register 
setting.  After  the  endsetscope  statement,  the  M 
register  will  be  restored  to  the  value  before  the 
setscope  statement  is  used. 


unary  function  calls  : 

1.  MINDEX  : 

syntax  :  MINDEX (par_var) 

2.  MAXDEX  : 

syntax  :  MAXDEX (par_var) 
3-  MINVAL (par_var) 

syntax  :  MINVAL (par_var) 
4.  MAXVAL (par_var) 

syntax  :  MAXVAL (pa r_var) 


MINDEX  will  return  a  index  parallel  variable  that 
has  set  the  responder  bit  which  has  the  minimum 
value  on  the  par_var  array. 

MAXDEX  will  return  a  index  parallel  variable  that 
has  set  the  responder  bit  which  has  the  maximum 
value  on  the  par_var  array. 

MINVAL  will  return  a  scalar  temporary  variable 
that  has  the  minimum  value  of  the  par_var  array. 

MAXVAL  will  return  a  scalar  temporary  variable 
that  has  the  maximum  value  of  the  par_var  array. 


indirect  address  statement  : 

syntax  :  @ (scalar_exp) : type : length [dimension] 

There  are  2  different  ways  to  use  the  indirect 
address  statement  : 

1.  scalar  result 

syntax  : 

@ (scalar_exp) : type : length [ index_par_var ] 

This  statement  will  return  a  scalar  temporary 
variable  which  has  the  value  in  the  array 
that  is  pointed  to  by  the  result  of 
scalar_exp  and  indexed  by  the  index_par_var . 
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example  :  @ (A) : int : 8 [xx] 
where  :  A  is  int  scalar  variable, 
xx  is  index _ par_var. 


2.  parallel  array  result 

syntax  :  @ (scalar_exp) : type : length [ $] 

This  statement  will  return  the  array  that  is 
pointed  to  by  the  result  of  scalar_exp. 

example  :  @ (A) : int : 8 [ $] 

where  :  A  is  int  scalar  variable. 


17.  assignment  statement  : 

log_par_var  =  log_par_exp; 
var  =  expressions; 
note  : 

All  data  types  should  have  the  same  type  for  the 
left  and  right  hand  side. 


arithmetic  operators: 

+  :  addition 
-  :  subtraction 
*  :  multiplication 
/  :  division 


relational  operators: 


< 

9 

.  It 

> 

9 

.gt 

<  - 

9 

.  le 

>  = 

9 

•ge 

— s 

9 

.eq 

1  =: 

9 

•  ne 

less  than 

greater  than 

less  than  or  equal 

greater  than  or  equal 

equal 

not  equal 


83 


logical  operators: 

!  ,  .not.  :  negation 

|  j  ,  .or.  :  or 
&&  ,  .and.:  and 

,  .xor.:  exclusive  or 


note  : 

If  you  use  parallel  variable,  you  should  use  $  in 
the  assignment  expression. 

ex:  varl[$]  =  var2[$]  +  var3[$]  +  5; 


NOTATIONS: 

(..1 

log_par_exp 

log_par_var 

index_par_var 


III.  Reserved  keywords: 

These  following  identifiers  are  reserved  for  use 


as  keywords 

and  may  not  be 

used  otherwise: 

int 

real 

scalar 

parallel 

index 

logical 

any 

allocate 

associate 

define 

def log 

defvar 

during 

end 

endallocate 

endany 

endfor 

endget 

endif 

endif nany 

endloop 

endnext 

endsetscope 

endwhile 

else 

elsenany 

for 

first 

get 

in 

include 

if 

ifnany 

loop 

main 

maxdex 

maxval 

mindex 

minval 

next 

procedure 

return 

setscope 

stop 

subroutine 

then 

thenany 

until 

whi  le 

with 

♦include 

The  above  keywords  which  are  not  currently 
implemented  by  this  compiler  are  reserved  for  future 
use.  Some  keywords  may  be  changed  for  further 
refinement. 


:  optional 

:  logical  parallel  expression 
:  logical  parallel  variable 
:  index  parallel  variable 


***  end  *** 


1.0  Introduction 


1.1  Associative  Super  Computing 

Associative  Super  Computing  is  a  model  of  computation,  where 
each  datum  has  its  own  dedicated  processor.  Associative  computing 
exploits  massive  -fine  grain  parallelism  in  a  natural  wav. 
Massively  parallel  associative  computers  with  16k  processors  have 
been  built,  eliminating  the  need  -for  time  shanna  a  single 
central  processing  unit  with  a  multitude  o-f  data  elements  thus 
avoiding  the  classic  memory  -  CPU  bottleneck  Associative 
computing  uses  massive  parallel  searching  in  place  o-f  address 
calculation,  reducing  programming  complexity.  This  paper 

describes  a  method  -for  implementing  data  structures  in  the 
associative  computing  model  which  is  applicable  to  all  types  o-f 
high  level  languages. 

1 . 2  Background 

The  concept  o-f  associativity  has  been  present  in  computer 
science  -for  many  years  (Jacfs.19^1  and  Findler,  1  )  .  The  most 

prevalent  realization  is  associative  triples  in  AI.  The 
association  list  and  ASSOC  -function  in  the  LISP  1  anguage  are  of 
course  implemented  in  conventional  hardware.  Content  addressable 
memories  are  manufactured  but  are  limited  in  use  due  to  their 
relatively  high  cost.  Associative  computers  were  first  deve-loned 
at  Goodvear  Aerospace  in  the  earlv  197,.*s  (Batcher.  I0-7"7).  C. 
Foster’s  boot  (1°76)  describes  the  basic  components  of  elementary 
associative  computers. 

The  first  higher  level  1 anguages  described  that  made  use  of 


the  parallel  nature  of  associative  computers  were  extensions  of 
FORTRAN  (Rotter,  1982)  and  PASCAL  (Reeves.  1°85'.  However,  more 
recent  languages  have  been  described  which  are  based  on  an 
associative  model  o-f  computation  (e.g.  Potter.  1°8'7'.  Inherent 
in  these  languages  is  the  concept  that  each  association  has  its 
own  dedicated  processor  and  that  computation  is  ef-fected  by 
repeatedlv  selecting  associations  to  be  processed.  Since  there 
is  no  -formal  mechanism  in  these  languages  to  enable  the 
specification  of  a  data  structure.  associative  data  structures 
had  to  be  implemented  at  the  assembly  language  level  (For 
example.  Rotter  .  1°87«  F'eed,l°85  and  Fot t er . 1 °85 > .  This  paper 
expands  on  the  concepts  used  for  assembls'  1  anguage  data 
structures  building  a  completely  general  hierarchy  of  data 
structures  which  can  be  used  in  anv  higher  order  language. 

2.0  Associative  F’rogramming 
2 . 1  Bac (ground 

The  t  hr  List  of  the  associative  computing  model  can  be  best 
e  plained  bv  analyzing  the  fundamental  components  of  a  program. 
In  this  section,  a  program  is  considered  as  a  message  between 
programmer  and  computer.  The  messaae  contains  two  major  types  of 
information.  the  procedural  component  and  the  identification 
component.  The  procedural  part  specifies  the  operations  to  be 
performed  and  the  order  in  which  the-.-  are  to  be  executed.  The 
l dent i f l c a t l cn  component  of  a  program  selects  the  data  to  be 
operated  on  bv  the  procedural  component.  Associative  computina 
uses  associative  addressina  for  the  identification  component. 

The  structure  of  data  provides  information.  Thus  in  the 
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record  shown  in  Figure  2.1.  the  number  50  can  be  recognised  as 
the  AGE  attribute  of  the  patient  JONES  not  bv  anv  inherent 
property  o-f  its  own.  but  bs'  the  fact  that  it  is  in  the  second 
(AGE)  position  of  the  record.  College  te:ts  on  data  structures 
often  describe  information  theory  as  developed  bv  Shannon L 1 °4° 3 
and  others  as  part  of  the  theoretical  bad  ground  for  data 
structures.  The  knowledge  that  the  second  element  of  the  data 
structure  contains  AGE  data  is  " l nf ormat l on "  in  the  classical 
sense  that  it  dispels  "uncertainty"  about  the  meaning  of  the 
number  50. 


PATIENT 


BLOOD 
PRESSURE 

Figure  2. 1  -  A  Simple  Data  Structure 

The  positional  information  content  of  a  data  structure  is 
established  bv  two  mappings.  The  first  mapping  is  between  the 
problem  data  and  the  logical  data  structure  based  on  the 
algorithm.  The  sec ond  mapping  is  between  the  logical  data 
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structure  and  the  physical  or g an l c at l on  of  the  computer  memory. 
These  mappings  are  established  bv  the  programmer  and  are  often 
the  most  crucial  aspect  of  program  development. 

When  a  program  is  viewed  as  a  message  between  a  programmer 
and  a  computer,  as  illustrated  in  Figure  2.2,  it  contains  two 
basic  types  of  information.  First,  the  program  directs  the 
computer  in  the  procedural  execution  of  the  fundamental  step'--  of 


the  algorithm.  Second,  the  program  embodies  the  positional 
information  of  the  data  structure  bv  fetching  and  delivering  to 
the  central  processing  unit  the  appropriate  data  at  the 
appropriate  time.  That  is,  if  the  algorithm  is  calculating 
hospital  insurance  premiums  based  on  age,  the  program  must 
deliver  the  aqe  datum  not  the  blood  pressure  datum  t.o  the  CPU. 
Since  this  data  delivery  tasL  involves  the  mappina  relationships 
mentioned  above,  it  can  be  modeled  bv  an  addressing  function  <AF) 
component  of  the  program. 
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A  Conventional  Program  Message 


The  addressing  -function  of  every  program  is  influenced  by 
three  factors:  1)  the  mapping  between  the  problem  data  and  the 

logical  data  structure  (called  the  logical  mapping),  2)  the 
mapping  between  the  logical  data  structure  and  the  physical  data 
structure  in  the  computer  memorv  (called  the  physical  mapping), 
and  3)  the  mapping  between  the  physical  data  structure  and  the 
CPU  as  a  result  of  the  fact  that  the  CPU  must  be  time  shared 
among  the  multiple  data  records  in  a  file. 

In  the  simplest  conceptual i r at l on ,  a  different  addressing 
function  is  required  for  fetching  each  individual  piece  of  data 
reauired  bv  an  algorithm.  However,  these  simple  addressing 
functions  are  combined  into  larger  more  comprehensive  and 
cample:.  functions  using  looping  and  address  modification 
(indexing)  techniques.  The  loop  construct,  for  example,  is  used 
ex tensi vel y  to  time  share  the  CPU  among  the  manv  identical 
records  of  a  file.  An  important  aspect  of  selecting  a  data 
structure  for  a  sequential  computer  is  to  pi  cl  one  which  allows 
the  addressing  functions  to  be  efficiently  folded  so  that  the 
loop  construct  can  be  used. 

2.2  Associative  Programming  is  Easy 

Since  associative  computers  reduce  the  complexity  of  addres 
functions  without  recursion  and  without  limiting  the  data 
structure,  they  are  easier  to  program  than  conventional  computers 
in  three  wavs.  First,  since  every  data  record  has  its  own 

dedicated  processor.  the  need  for  a  "time  sharing"  factor  i n  the 
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address  -function  is  eliminated.  Second.  the  physical  mapping 
component  of  the  address  function  is  replaced  bv  parallel 
(associative'  searching.  Finally.  as  described  in  Section  7.0 
the  1 oa ical  mapping  relationship  is  stored  associatively  as 
Structure  Codes  with  the  data  elements  eliminating  the  need  for 
run  time  calculations. 

For  example.  in  Figure  2.7.  the  logical  portion  of  the 
address  function  consisting  of  the  matri::  row  and  column  indices 
are  stored  with  the  data  elements.  Since  the  data  structure  codes 
are  dependent  only  on  the  logical  mapping,  the  programming  tasl 
is  reduced  to  1)  directing  the  computer  in  the  sequential 
execution  of  the  fundamental  steps  of  the  algorithm  and  2)  the 
manipulation  of  the  logical  data  structure  codes.  The  artifacts 
of  time  sharing  the  CPU  and  the  physical  sequential  or gan l c at l on 
of  memory  are  eliminated. 
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FIGURE  2.3  —  An  Associative  Program  Message 

3.0  Associative  Data  Structures 
3 .  1  Pact  ground 

Arrays  are  the  canonical  -forms  o-f  data  structures.  Their 
address  -functions  -form  a  natural  hierarchy  o-f  complexity. 
Scalars  are  rero  dimensional  arrays.  They  are  represented  b-y  the 
class  o-f  address  -functions  consisting  only  o-f  constants.  The 
class  o-f  address  -functions  -for  one  dimensional  arrays  consist  o-f 
constants  plus  one  variable.  Two  dimensional  arravs  have  two 
variable  address  -functions,  etc.  The  most  common  example  o-f 
address  -functions  -for  arrays,  are  the  row-major  and  column-major 
ordering  -functions  generated  automatically  -for  indexed  arrays  by 
most  high  order  languages  such  as  FORTRAN  and  F'ASCAL. 

To  date.  content  addressable  memories  ^CAM)  and  associative 
computers  (AC)  have  been  used  almost  exclusively  -for  storing  and 
retrieving  simple  objects.  An  object  is  often  defined  as  a 
location-value  pair.  Accordingly,  an  associative  object  contains 
two  parts,  a  data  element  (or  value)  and  the  associated  value  of 
the  logical  addressing  function  (location)  or-  Data  Structure 
Code.  The  name  of  an  object  serves  as  its  constant  address 
function.  As  shown  in  Figure  3.1,  simple  scalar  variables  are 
synonymous  with  attribute  value  pairs  in  associative  programming. 
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col  or  ! b 1 ue 
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Figure  3.  1  -  Simple  Scalar  Structure  Codes 

3.2  Simultaneous  Multiple  Data  Organ i z at l ons 

Since  in  the  most  general  case,  the  value  of  an  object  can 
be  o-f  any  composition  (Elson,l0_73  p.  68).  associative  triples 

can  also  be  accommodated  bv  constant  address  -functions.  Note 

that  three  different  sets  o-f  structure  codes  are  illustrated  in 

Figure  3.2.  one  -for  each  part  o-f  the  associative  triple.  Since 

all  memory  locations  are  searched  in  parallel.  their  is  no  a 

priori  reason  to  select  one  set  of  structure  codes  over  an, 
other.  In  fact  if  there  are  n  components  in  an  object  (i.e. 
values  and  codes)  there  are 

n-1 

\  n 

/  C 

1  =  1 

constant  address  functions.  all  of  which  can  be  used  at  the 
programmers  discretion,  intermingled  in  any  order  without  any 
need  for  reordering.  This  is  impossible  in  any  sequent l ona 1 
computer  since  in  a  sequential  computer,  the  data  structures  must 
be  sorted  to  be  efficiently  accessed  and  data  can  be  organized 
into  only  one  ordering  at  a  time. 

3.3  Data  Organization  Man i pul  at l on . 

The  structure  codes  are  discussed  as  if  they  were  uni que 
d  a  t  a  items.  In  r  ea  1  l  t  v  they  are  not  .  They  are  .  us  t  life  t  f  >t 


Q 


other  data  items  in  an  associative  object  in  that  they  can  be 


searched  -for  and  manipulated.  Structure  codes  are  unique  only  in 
‘  hat  they  contain  structural  information  on  how  one  problem  data 
element  rel ates  logically  to  the  other  problem  data  elements. 
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Figure  3.2  -  Comple::  Scalar  Structure  Codes 

3.4  Generalized  Structure  Codes 

One  dimensional  arrays  can  be  stored  in  CAM’s  and  AC’s  bv 
using  a  straight  -forward  extension  of  scalar  structure  codes.  The 
structure  code  consists  of  the  object  name  (the  constant  portion 
of  the  address  function)  and  the  position  of  the  value  in  the 
construct  (the  -variable  portion  of  the  address  function).  The 
variable  component  for  one  dimensional  arravs  is  simplv  the 
ordinal  position  of  the  data  element  in  the  array.  Thus,  for 
example,  the  one  dimensional  object  A  =  (15432)  would  have 

the  structure  code  shown  in  Fiaure  3.7'. 
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Figure  3.3  -  A  One  Dimensional  Array 


The  structure  code  -for  two  dimensional  arrays  is  a  natural 
extension  o-f  one  dimensional  arrays  as  shown  in  Figure  3.4.  The 
extension  o-f  structure  codes  to  higher  dimensional  arrays  is 
obvious.  The  composition  and  manipulation  o-f  these  canonical 
array  structure  codes  to  male  structure  codes  -for  complex 
compound  data  structure  is  considered  next. 
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4.0  Examples 

4.1  Associative  Data  Structure  References 


One  dimensional  arrays  are  loaical  data  structures  which  are 


natural  -for  use  with  several  common  problem  data  structures  such 
as  vectors,  lists  and  strings.  Two  dimensional  arrays  are 
logical  data  structures  which  are  natural  -for  dealing  with 
matrices  and  imagery.  The  mapping  -from  these  problem  data 
structures  to  the  logical  data  structure  is  the  identity  mapping. 
Consequently,  tor  ease  o-f  reading,  where  no  contusion  can  arise, 
the  terms  vector  and  matri::  will  be  used  interchangeably  tor  one 
dimensional  and  two  dimensional  arrays  respectively. 

It  is  not  uncommon  to  consider  matrices  as  collections  ot 
vectors.  Thus  it  the  constant  portion  ot  '  he  structure  code 
shown  in  Figure  3.4  is  moditied  to  include  "row  position."  The 
constant  address  " &  1"  is  shared  by  two  values  representing  the 
vector  (5  3)  and  "&  2"  represents  (7  6).  Similarly,  it  the 
constant  portion  is  moditied  to  include  "column  position"  instead 
ot  "row  position,"  "&  1"  represents  (5  7>“*  and  "B  2" 

represents  (3  6)_1. 

An  important  property  ot  structure  codes  is  the  ability  to 
reorganize  them  as  illustrated  above.  The  "."  operator  will  be 
used  to  indicate  the  basis  code  grouping  and  can  be  thought  ot  as 
a  concatenat  i  on  operator.  The  symbol,  is  used  as  a  place 

holder.  Thus  the  code  B.l.'-'  represents  the  vector  <5  7).  h  .  .  Z 

represents  (  7  6)_1,  etc. 

4.2  Data  Structure  Code  Manipulation 

The  concept  ot  combining  data  structures  to  term  new  data 
structures  is  common  in  some  languages  such  a  LISP.  For  example, 
lists  can  be  grouped  together  to  torm  lists  ot  lists,  etc.  This 
can  be  done  because  ot  the  generalized  method  ot  data  storage  tor 


1  1  sts 


However 


this  capability  can  not  be  extended  to  other 


types  of  data  structures  such  as  aravs  because  they  use  an 

addressing  access  -function  not  a  content  addressable  access 
•function.  In  associative  computing,  the  ability  to  create  new 
data  structures  -form  existing  data  structures  at  run  time  is 
possi b 1 e . 

In  order  to  describe  how  the  structure  codes  -for  two 
arbitrary  data  structures  can  be  combined  to  generate  the 

structure  codes  -for  a  combined  data  structure.  several 
definitions  are  necessary.  Let  DSj  be  a  data  structure  of 
dimension  r  with  address  function  A  j  .  Then  Aj  =  aj0.ajl..ajr 

represents  the  r+1  components  of  the  structure  code.  By 

convention,  the  Oth  component  is  the  constant  portion  which  is 
the  name  of  the  data  structure.  Let  Aj(m>  stand  for  the  structure 
code  of  A _i  for  the  mth  element  of  DSj.  Let  01  denote  the 

constant  value  0,  0=  denote  0.0,  03  denote  0.0.0,  etc.  Then 
0 r’  denotes  the  constant  zero  structure  code  for  a  function  with 
n  components.  Similarly,  let  A^Ix)  denote  the  first  (left 
most)  n  components  of  a  structure  code.  The  depth  of  a  component 
is  equivalent  to  the  number  of  components  to  its  left. 

Then  if  D31  is  the  complex  data  structure  obtained  by 
inserting  data  structure  DS)  with  dimension  s,  as  the  mth  element 
cf  DSj  with  dimension  r,  at  depth  d,  the  address  function  A1  for 
DS1  has  dimension  d+s,  and  is  given  by 

A 1  ( : :  '  =  A  O***—-"  for  ::  1  =  m 


A 1  ( : :  >  =  A  ,(::).  A)  (  v )  for  :-:=m  for  all  y  in  DSD  . 

The  data  structure  insertion  operation  is  denoted  by: 
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.... 


receiving_data_structure!  !  Cel  ement ,  depthhriserted_data_strucUire. 
I -f  a  comple:  structure  is  to  be  built  bv  a  number  of  insertions, 
they  may  occur  in  any  order,  i.e.  if  ml  1 =  m2,  then 

( A1  ! I [ml , d 1 D  Af  >  :!Cm2.d2D  Aj  =  (A1  ::Cm2,d23  A,'  IlCml.dll  At 
Figure  4.1  gives  an  example.  DSa  is  an  "empty”  vector  with 
address  function  Aa  =  <1  2).  DSb  and  DSc  are  both  matrices  with 
the  same  address  function  Ab=Ac=<l.l.  1.2,  2.1.  2.2'.  The 
composition  A= (Aa  IIC1.13  Ab )  I1C2.11  Ac  is  shown.  Clearly, 
arbitrarily  comple::  hierarchical  data  structures  can  be  composed 
from  the  basic  canonical  forms. 

This  approach  is  completely  general  .  Lists  are  simpW  a 
special  case  since  thev  are  "vectors"  whose  elements  are  atoms  or 
other  lists.  Address  function  composition  can  be  applied  to  list 
structure  codes  to  generate  the  structure  codes  for  any  comple:: 
nested  list.*  Figure  4.2  illustrates  the  structure  codes  for  a 
list. 


*  Structure  codes  for  lists  have  been  developed  separately  and 
described  elsewhere  (Potter , 19B3  and  Reed,l°85).  The  thrust  of 
this  paper  is  to  develop  a  comprehensive  approach  to  addressing 
functions  and  illustrate  their  generality.  The  reader  is  re-fere d 
to  the  above  references  for  details  on  programming  using 
structure  codes. 
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dual  approach  to  referencing  strings  is  a  natural  artifact  of 


using  associative  addressing  techniques.  The  string  as  a  whole 
can  be  accessed  by  the  structure  code  S."’  while  the  nth  character 
in  the  string  can  be  accessed  bv  S.n.  Note  that  this  capability 
is  due  to  the  parallel  associative  implementation  of  structure 
codes  and  does  not  require  multiple  variable  declarations  or 
equ i v  a 1 en c  es . 
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Figure  4.3  -  A  String 


4.4  Associative  Stacl  and  Queues 

Other  commonly  used  data  structures.  such  as,  stacfs. 
queues ,  and  1  ini ed  storage  can  also  be  handled  in  the  associative 
model.  Stacls  and  queues  are  simply  variable  length  vectors.  A 
stall  push  is  accomplished  bv  adding  a  new  (larger)  ordinal 
position  to  the  vector.  A  pop  is  simply  the  selection  of  the 
largest  element  of  the  vector  and  its  removal.  Queue  and  1  ini  ed 
lists  can  likewise  be  easily  implemented.  However,  it  should  be 
emphasized  that  these  data  structures  are  artifacts  of 
conventional  sequential  structures,  and  that  if  the  data  to  be 
steeled  or-  queued  is  stored  in  associative  memor  .  with  a  time 
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tag.  the  need  -for  these  structures  is  eliminated. 

5. 'I*  Conclusions 

This  paper  has  presented  a  unified  approach  tor  representing 
arbitrarilv  complex  data  structures  in  content  addressable 
memories  and  associative  computers.  This  approach  to  data 
structures  in  associative  computers  has  the  advantages  of  1) 
automatical lv  extracting  fine  grain  parallelism,  2)  eliminating 
much  of  the  complexity  of  the  non-al gor i thmi t i c  address 
computation  in  program  development,  3)  allowing  multiple  data 
structures  to  be  associated  with  each  datum.  4)  allowing  the  data 
structures  themselves  to  be  modified,  and  5)  allowing  information 
exchange  between  vastly  different  program  languages  such  as  LISF', 
PROLOG.  OF  S3.  FORTRAN  and  PASCAL. 

Some  areas  for  future  research  are: 

1)  defining  arithmetic  operations  on  complex  data  structures 
as  a  natural  extension  of  element  by  element  arithmetic  of 
vectors  and  matrices, 

2)  the  utilization  of  multiple  distinct  structure  codes  in 
the  same  datum.  In  general .  there  can  be  a  different 
structure  code  for  every  logical  hierarchical  data  structure 
to  which  the  datum  belongs.  This  aspect  mav  be  part :  c:"l  ar  1  y 
useful  tor  semantic  networl =  and  frames  in  A1  applications. 

3.*  the  development  of  universal  operators  1  o<  the 
manipulations  of  structure  codes.  For  example,  the  operator 
"root"  will  generate  the  structure  code  for  the  root  of  a 
tree  from  the  structure  code  of  any  of  its  nodes  (bee 
Rot  ter.! .  and 
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4)  the  investigation  of  mathematical  properties  of 

addressing  -functions  and  structures  codes. 
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